Efficient spoken dialogue control depending on the speech recognition rate and system's database
نویسندگان
چکیده
We present dialogue control methods (the dual-cost method and the trial dual-cost method) that enable a spoken dialogue system to convey information to the user in as short a dialogue as possible depending on the speech recognition rate and the content of its database. Both methods control a dialogue so as to minimize the sum of two costs: the confirmation cost (Ccost) and the information transfer cost (I-cost). The C-cost is the length of a subdialogue for confirming a user query, and the I-cost is the length of a system response generated after the confirmations. The dual-cost method can avoid the unnecessary confirmations that are inevitable in conventional methods. The trial dual-cost method is an improved version of the dualcost method. Whereas the dual-cost method has the limitation that it generates a system response based on only the content of a query that the user has acknowledged in the confirmation subdialogue, the trial dual-cost method does not. Dialogue experiments prove that the trial dual-cost method outperforms the dual-cost method and that both methods outperform conventional ones.
منابع مشابه
Linguistic and acoustic features depending on different situations - the experiments considering speech recognition rate
This paper presents the characteristic differences of linguistic and acoustic features observed in different spoken dialogue situations and with different dialogue partners: human-human vs. human-machine interactions. We compare the linguistic and acoustic features of the user’s speech to a spoken dialogue system and to a human operator in several goal setting and destination database searching...
متن کاملSpoken Dialogue Control Based on a Turn-minimization Criterion Depending on the Speech Recognition Accuracy
This paper proposes a new dialogue control method for spoken dialogue systems. The method configures a dialogue plan so as to minimize the estimated number of turns to complete the dialogue. The number of turns is estimated depending on the current speech recognition accuracy and probability distribution of the true user’s request. The proposed method reduces the number of turns to complete the...
متن کاملStochastic Language Adaptation over Time andState in Natural Spoken Dialogue
| We are interested in adaptive spoken dialogue systems for automated services. Peoples' spoken language usage varies over time for a given task, and furthermore varies depending on the state of the dialogue. Thus, it is crucial to adapt ASR language models to these varying conditions. We characterize and quantify these variations based on a database of 30K user-transactions with AT&T's experim...
متن کاملNew Feature Parameters For Detecting Misunderstandings in a Spoken Dialogue System HIRASAWA
This paper describes new feature parameters for detecting misunderstandings in a spoken dialogue system. Although recognition errors cannot be completely avoided with current speech recognition techniques, a spoken dialogue system could be a good human-machine interface if it could automatically detect and recover from its own misunderstandings during natural interaction between it and a user. ...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003